Search results for "Web crawler"
showing 5 items of 5 documents
Mobile Search - Social Network Search Using Mobile Devices
2008
During the last years progress in Web search engines has been made to the point that relevant information can be reached easily most of the time. However very little empirical research has been carried to study Web search in highly dynamic social network environments composed of mobile devices. The aim of this work was therefore to investigate novel approaches that took advantage of the social network environment inherent to mobile peer-to-peer paradigm. The work focused mainly on the development of a prototype for mobile search concept. The prototype was built on top of Drupal content site management system. This study suggests that the methods presented can be a complement to traditional …
A web search methodology for different user typologies
2009
Search engines and directories are the main tools used to find desired information into the ocean of digital contents that is the Web. However, they are not presently able to understand the user specific needs and starting knowledge because their inability to simulate the processes of human mind. Natural Language Processing, Folksonomy, Semantic Web and Serendipitous Surfing are some of the recent research fields towards understanding of human natural language and in general of real user needs. This work aims to add one step more to this evolution path by presenting a new search methodology that allows users to create new knowledge paths on the web based on their specific requirements. Thus…
Online activity traces around a "Boston bomber"
2013
This paper describes traces of user activity around a alleged online social network profile of a Boston Marathon bombing suspect, after the tragedy occurred. The analyzed data, collected with the help of an automatic social media monitoring software, includes the perpetrator's page saved at the time the bombing suspects' names were made public, and the subsequently appearing comments left on that page by other users. The analyses suggest that a timely protection of online media records of a criminal could help prevent a large-scale public spread of communication exchange pertaining to the suspects/criminals' ideas, messages, and connections.
On Utilizing Stochastic Non-linear Fractional Bin Packing to Resolve Distributed Web Crawling
2014
This paper deals with the extremely pertinent problem of web crawling, which is far from trivial considering the magnitude and all-pervasive nature of the World-Wide Web. While numerous AI tools can be used to deal with this task, in this paper we map the problem onto the combinatorially-hard stochastic non-linear fractional knapsack problem, which, in turn, is then solved using Learning Automata (LA). Such LA-based solutions have been recently shown to outperform previous state-of-the-art approaches to resource allocation in Web monitoring. However, the ever growing deployment of distributed systems raises the need for solutions that cope with a distributed setting. In this paper, we prese…
Efficiency Analysis Of Resource Request Patterns In Classification Of Web Robots And Humans
2018
The paper deals with the problem of classification of Web traffic generated by robots and humans on e-commerce websites. Due to the still growing proliferation and specialization of bots, a large body of research into characterization and recognition of their traffic has been conducted so far. In particular, some approaches to classify bot and human sessions on websites have been proposed in the literature. In this paper we verify and discuss the efficiency of such recently proposed approach, which uses differences in resource request patterns of bots and humans. We reconstructed Web sessions from actual HTTP log data for three different e-commerce sites, varying in the traffic intensity an…